A Flexible Interface Tool for Manual Word Sense Annotation
نویسندگان
چکیده
This paper introduces LX-SenseAnnotator, a user-friendly interface tool for manual word sense annotation. The demonstration will show how input texts are loaded by the tool, the options available to the annotator for displaying and browsing texts, and how word senses are displayed and manually assigned. The flexibility of LX-SenseAnnotator, including the support of a variety of languages and the handling of pre-processed texts with different tagsets, will also be addressed.
منابع مشابه
MaJo - A Toolkit for Supervised Word Sense Disambiguation and Active Learning
We present MaJo, a toolkit for supervisedWord Sense Disambiguation (WSD), with an interface for Active Learning. Our toolkit combines a flexible plugin architecture which can easily be extended, with a graphical user interface which guides the user through the learning process. MaJo integrates offthe-shelf NLP tools like POS taggers, treebank-trained statistical parsers, as well as linguistic r...
متن کاملInforex - a web-based tool for text corpus management and semantic annotation
The aim of this paper is to present a system for semantic text annotation called Inforex. Inforex is a web-based system designed for managing and annotating text corpora on the semantic level including annotation of Named Entities (NE), anaphora, Word Sense Disambiguation (WSD) and relations between named entities. The system also supports manual text clean-up and automatic text pre-processing ...
متن کاملbrat: a Web-based Tool for NLP-Assisted Text Annotation
We introduce the brat rapid annotation tool (BRAT), an intuitive web-based tool for text annotation supported by Natural Language Processing (NLP) technology. BRAT has been developed for rich structured annotation for a variety of NLP tasks and aims to support manual curation efforts and increase annotator productivity using NLP techniques. We discuss several case studies of real-world annotati...
متن کاملIMI -- A Multilingual Semantic Annotation Environment
Semantic annotated parallel corpora, though rare, play an increasingly important role in natural language processing. These corpora provide valuable data for computational tasks like sense-based machine translation and word sense disambiguation, but also to contrastive linguistics and translation studies. In this paper we present the ongoing development of a web-based corpus semantic annotation...
متن کاملTowards a Computational Model of Gradience in Word Sense
In the construction of lexicons or word sense inventories, researchers have usually strived to de ne a set of disjoint senses for a word; however, there is increasing evidence that this is impossible in the general case. In this programmatic paper, we support this point with sense overlap data from manual sense annotation. We then sketch the road towards a fundamentally graded representation of...
متن کامل